Express Mail Label No. 
ER 121786982 US 



Date of Deposit: November 13, 2003 



10 



CLOCK GATED POWER SUPPLY NOISE COMPENSATION 
CROSS REFERENCE TO RELATED APPLICATION 

[0001] The present invention is related to U.S. Application Serial No. ' 



(Attorney Docket No. YOR20030363US1) entitled "BUILT IN SELF TEST FOR 
MEASURING TOTAL TIMING UNCERTAINTY IN A DIGITAL DATA PATH" to 
Robert L. Franch et al., filed coincident herewith and assigned to the assignee of the 
present invention. 

BACKGROUND OF THE INVENTION 



Field of the Invention 

[0002] The present invention is related to integrated circuit (IC) design systems and 
15 more particularly to characterizing timing uncertainties in ICs. 

Background Description 
[0003] Large high performance very large scale integration (VLSI) chips like 
microprocessors are synchronized to an internal clock. A typical internal clock is 

20 distributed throughout the chip, triggering chip registers to synchronously capture 

incoming data at the register latches and launch data from register latches. Ideally, each 
clock edge arrives simultaneously at each register every cycle and data arrives at the 
register latches sufficiently in advance of the respective clock edge, that all registers latch 
the correct data and simultaneously. Unfortunately, various chip differences can cause 

25 timing uncertainty, i.e., a variation in edge arrival to different registers. 

[0004] Such timing uncertainties can arise from data propagation variations and/or 
from clock arrival variations. Data propagation variations, for example, may result in a 
capturing latch that raindomly enters metastability or latches invalid data because the data 
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may or may not arrive at its input with sufficient set up time. Clock edge arrival 
variations include, for example, clock frequency fluctuations (jitter) and/or register to 
register clock edge arrival variations (skew). Both data path and clock edge arrival 
variations can arise from a number of sources including, for example, ambient chip 
5 conditions (e.g., local temiperature induced circuit variations or circuit heat sensitivities), 
power supply noise and chip process viariations. In particular, power supply noise can 
cause clock propagation delay variations through clock distribution buffers. Such clock 
propagation delay variations can cause skew variations from clock edge arrival time 
imcertainty at the registers. Typically, chip process variations include device length 
10 variations with different device lengths at different points on the same chip. So, a buffer 
at one end of a chip may be faster than another identical (by design) buffer at the opposite 
end of the same chip. Especially for clock distribution buffers, these process variations 
are another source of timing uncertainty. 

[0005] Furthermore, as technology features continue to shrink, power bus or Vdd 
IS noise is becoming the dominant contributor to total timing uncertainty. High speed 
circuit switching may cause large, nairow current spikes with very rapid rise and fall 
times, i.e., large dl/dt. In particular, each of those current spikes cause substantial voltage 
spikes in the on-chip supply voltage, even when power supply inductance (L) is 
minimized. Because the voltage across the inductor is V=LdI/dt, fliese supply line spikes 
20 also are referred to as L dl/dt noise or, simply, dl/dt noise. Since current switching can 
vary from cycle to cycle, the resulting noise varies from cycle to cycle. When the Vdd 
noise drops the on-chip supply voltage in response to a large switching event, it slows the 
entire chip, including both the clock path (clock buffers, local clock blocks, clock gating 
logic and etc.) as well as the data path logic (combinational logic gates, inverters and etc.) 
25 and may cause the chip to faiL When the noise dissipates and the on-chip supply later 
recovers, or even overshoots as the supply current falls; then, the circuits (buffers, gates 
and etc.) in these same paths speed up, returning to their nominal performance (with the 
normal stage delay) or even faster when the supply rises above nominal. If the supply 
rises too far above nominal, devices may be stressed beyond breaking to damage the chip 
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or, at the very least reduce chip reliability. The number of stages that can complete 
changes as the data path slows down or speeds up relative to the clock path. Currently, in 
particular, such switching noise is a significant component of total timing imcertainty, 
comparable to skew or jitter (which are themselves affected by switching noise) or chip 
S process variations. 

[0006] Thus, it would be useful to be able to identify dl/dt noise as it occurs and 
minimize how it affects circuit performance. 

SUMMARY OF THE INVENTION 

10 [0007] It is a purpose of the invention to improve integrated circuit (IC) chip design; 

[0008] It is another purpose of the invention to reduce dl/dt chip effects; 

[0009] It is yet another purpose of the invention to simplify chip design 
requirements for dl/dt effects; 

[0010] It is yet another purpose of the invention to sense the onset of dl/dt noise and 
1 S take steps to minimize how it affects circuit performance. 

[001 1] The present invention relates to a supply noise compensation circuit. The 
supply noise compensation circuit senses the onset of dl/dt noise events on a supply line 
and selectively gates ofG^forces on a chip clock to chip circuits. 

20 
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BRI£F DESCRIPTION OF THE DRAWINGS 

[0012] The foregoing and other objects, aspects and advantages will be better 
understood from the following detailed description of a preferred embodiment of the 
5 invention with reference to the drawings, in which: 

[0013] Figure 1 shows a block diagram of an example of a supply noise 
compensation circuit according to a preferred embodiment of the present invention; 

[0014] Figure 2 shows dl/dt noise in a supply line; 

[OOlS] Figure 3 shows a noise compensation circuit with a simple voltage sense 
10 sensing dl/dt spikes. 

DESCRIPTION OF PREFERRED EMBODIMENTS 

[0016] Turning now to the drawings and, more particularly, Figure 1 shows a block 
diagram of an example of a supply noise compensation circuit 100 according to a 

15 preferred embodiment of the present invention. A local clock block (LCB) or clock 
buffer 102 receives and re-drives a global chip clock 104 into 2 complementary local 
clocks 106, 108. One clock, a launch clock 106, is provided as an input to a delay line 
1 10 that is sensitive to supply voltage changes. The local clock, e.g., launch clock 106, 
enters the delay line 110 and, as it propagates through the delay line 1 10, the LCB 102 

20 and delay line 1 10 niimic data propagation delay through an actual data path, e.g., in a 
microprocessor. In particular, the launch clock propagating along the delay line 1 10 
reflects propagation delay variations resulting from switching or dl/dt noise on the circuit 
power supply (Vdd) line. Both the launch clock 106 and the second clock, a capture clock 
108, clock an N bit register 112. For example, N = 129 may be convenient for holding 3 

25 edges worth of clock edges. The N bit register 112 latches the state of the delay line 110 
as reflected at delay line taps 1 14. Thus, in this example the capture clock 108 captures 
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the forward position of the timing edges in the N bit register 1 12, Register contents are 
interrogated in compare circuit 116, which locates timing edges in the delay line 1 10 and 
identifies clock cycle to clock cycle delay changes, up or down. Thus, the delay line and 
register 1 12 act as a supply noise sensor. The output of the compare circuit 116 is an 
5 input to a clock skip circuit 118, which selectively throttles back on the clock, e.g., 
selectively skipping one or more clocks. 

[0017] Although in this example, the launch clock 106 drives the delay line 1 10, 
either clock, the launch or the capture clock, can drive the delay Ime 1 10. In this 
example, the rising edge of launch clock 106 and the falling edge of the capture clock 
10 108 (which latches the data) are coincident, and this edge of interest marks the end/start 
of the cycle boundary. It should be noted that the present invention is described herein 
with the registers (e.g., 1 12) being clocked by complementary clocks 106, 108. This is 
for example only and not intended as a limitation and the registers/latches may be pulsed 
latches or any suitable equivdent register/latch such as are well known in the art. 

15 [0018] Preferably, the delay difference.between each pau: of taps 1 14 is equivalent 
to one logic block delay. Typically, the total timing uncertainty metric is the number of 
combinational logic stages that complete in a cycle, sometimes referred to as the fan-out 
of 4 (F04) inverter count or F04 number. However, for the best time resolution the 
preferred delay between delay line taps 1 14 is the minimum delay for the particular 

20 technology, e.g., the delay for a single fan-out (FOl) inverter or, for example 20 

picoseconds (20ps). Preferably also; the delay line 1 10 is at least three clock periods long 
with nominal supply voltage, i.e., long enough that the start of one clock cycle, the 
leading clock edge, has not propagated through the delay line 110 before the start of 
second following cycle enters the delay line 110. Therefore, preferably, in the absence of 

25 noise the delay line 1 10 hais 3 edges passing through it. The N bit register 112 is clocked 
by both the launch clock 106 and the capture clock 108. Essentially, at the start of a 
global clock period, the launch clock 106 paisses a previously loaded N bits out of the 
register 1 12 as the leading edge begins traversing the delay line 110. At the end of each 
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global clock period, the capture clock 108 latches the state of the delay line taps 1 14 in 
the capture register 1 12, capturing the progress of the launch clock 106 edges through the 
delay line 110. The captured edges are at evenly spaced taps 1 14 in the absence of dl/dt 
noise other sources of timing uncertainty and such other sources may cause a variation of 
S a couple of taps 1 14. However/upon the occurrence of dl/dt noise, the edge locations 
may be much more closely spaced when the supply voltage spikes negative (below Vdd) 
because the delay line is slower and much more widely spaced when the supply is 
rebounding (above Vdd). 

[0019] The delay line 110 may be a series of suitably loaded inverters with delay 
line taps 1 14 being the inverter outputs, for example. As a result, the taps 1 14 alternate 
ones and zeros and the clock edges are located by a matched pair (either 2 zeros in a row, 
or 2 ones in a row) of adjacent delay line taps 1 14. The space between matching tap 
pairs, e.g., 60 inverter stages or 1.2 nanoseconds (1.2ns for 20ps inverters) between 
leading/rising clock edges for 3 clock edgeis traversing a 128 tap delay line 1 10, is a 
nieasure of logic propagation diiring a complete clock cycle. Thus, essentially, the same 
local clock block 102 both launches and captures the timing edges and^ because the local 
clock itself is the launched data, the clock takes a snapshot of itself in the capturing 
latches. 

[0020] Jitter from a phase locked loop (PLL), for example, may cause as much as a 
20 few occasional short cycles in a row. Although not required, the circuit timing can be 
adjusted so that the first edge (e.g., a leading or rising edge) is always captured in bit 
position 0 (register latch 0) and in the absence of jitter, the second (leading) edge is in bit 
60 and the third in bit position 120. Without clock timing uncertainty, the edges always 
fall in the same positions, 0, 60, 120. However, with an occasional short cycle the second 
25 edge (for the shorter cycle) shifts by one to bit 59; the third edge is captured in bit 1 19. 
With 2 consecutive short cycles, however, the second edge still shifts to bit 59, but the 
third edge shifts to bit 1 18. In each example, flie short clock is no more than a single 
delay shorterAonger than its neighbors. 
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[002 1 ] Figure 2 shows an example of a section of a supply noise characterization 
plot 120 showing dl/dt noise in a stipply line, which may be characterized as described in 
U.S. Application Serial No. • " (Attorney Docket No. YOR920030363US1) 
entitled "BUILT IN SELF TEST POR MEASURING TOTAL TIMING 
5 UNCERTAINTY IN A DIGITAlJ DATA PATH" to Robert L. Franch et al., filed 

coincident herewith, assigned to the assignee of the present invention and incorporated 
herein by reference. Upon the occurrence of a dl/dt noise spike, which typically lasts 
several clock cycles (e.g., anywhere from 10 - 50 cycles), the noise spike drives the 
supply to the delay line inverters 110 below nominal, reducing inverter switching speed 

10 and increasing inverter propagation delay, 2-3 register bits at about 2ns in this example 
120. By the end of the next cycle at about 3.6ns in this example, the delay line slows 
such that the preceding edges have propagated 10 fewer stages. Also, it should be noted 
that the present invention has application to measurement results as described in Franch 
et al. and such measurements may be used to sense the onset of a dl/dt noise event to 

15 mitigate the effects of such an event in accordance with the present invention. 

[0022] So for this example, instead of edges bemg captured at register bit locations 
1, 60 and 120, by the end of the first cycle, edges are captured edges are at register bit 
locations 1, 58 and 116 because the noise spike slows both edges. Further, by the end of 
the second cycle, captured edges are at register bit locations 1, 50 and 108. Similarly, as 

20 the current responsible for the noise spike begins to fall, the supply voltage spikes 

positive, accelerating edge travel through delay line 1 10 to the point where only 2 edges 
(in this example) are propagating through delay line 1 10. A preferred embodiment 
integrated circuit (IC) or IC with a supply noise compensation circuit (e.g., 100 in Figure 
1) senses the onset of dl/dt noise and responds by selectively skipping/forcing clock 

25 cycles to mitigate the dl/dt noise spikes and so, the extreme effects of dl/dt noise spikes. 

[0023] So referring again to Figure 1, when the compare 116 identifies at least a 2 
bit position reduction between cycles, for example, the compare 116 sends a signal to 
skip control circuit 1 18 to block the clock for at least the next cycle. Optionally, in 
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addition whenever, the compare 1 16 identifies at least a 2 bit position increase between 
cycles, for example, the compare 116 sends a signal to skip control circuit 1 18 to force 
the clock for at least the next cycle, i.e., preventing clock blocking for at least the next 
cycle. Furthermore, a single supply noise compensation circuit 100 may be located at the 
5 beginning of the chip clock tree, throttling the whole chip down/up in response to dl/dt 
noise or, supply noise compensation circuits 100 may be distributed throughout the chip 
clock tree, selectively throttling portions of the chip down/up in response to localized 
dl/dt noise. 

[0024] In particular, for a complex pipelined IC such as a microprocessor, where 
chip units or block of logic may use localized power up/down techniques, a local supply 
noise compensation circuit 100 may be provided with the chip units. Each local supply 
noise compensation circuit 100 may selectively delay powering up/down to better 
distribute instantaneous chip supply demands and, thereby, reduce dl/dt noise. Also, skip 
driver 118 may be selected to block/force cycles until the event has subsided partially or 
completely (e.g., an AND of the output of compare circuit 1 16 with the global clock), to 
block/force alternate cycles or any combination thereof 

[0025] Further, a simple voltage sense may be used to sense dl/dt spikes as shown in 
the noise compensation circuit 130 example of Figure 3, instead of delay line 1 10 and 
register 1 12 of the example 100 of Figure 1. In this example, supply voltage is averaged 
20 with in an RC filter 132 and compared in voltage compare 134. A skip timer 136, e.g., a 
simple D-type latch, is synchronized to global clock 138 and selectively block/passes the 
global clock in AND gate 138. When the instantaneous supply voltage to voltage 
compare circuit 134 is below the average voltage at RC filter 132 by a minimum 
instantaneous voltage difference (d), the voltage compare circuit 134 indicates the 
25 occurrence of dl/dt noise. Upon receipt of the indication, the skip timer 136 send a block 
signal synchronized to global clock 138 to the AND gate 140 that blocks at least the next 
clock cycle. The skip timer 136 prevents spurious local clocks firom occurring, e.g., firom 
a change in the voltage compare 134 mid cycle; 
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[0026] Thus, advantageously, a preferred embodiment IC can sense the onset of 
dl/dt noise and avoid the potentially disastrous effects on IC units and even mitigate the 
dl/dt noise spike itself. 

[0027] While the inyention has been described in terms of preferred embodiments, 
S those skilled in the art will recognize that the invention can be practiced with 
modification within the spirit and scope of the appended claims. 
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